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A coordlnatc-free  approach  to  linear  prediction  la  developed.  The  results 
extend  those  obtained  by  Wataon  (1972)  and  they  are  applied  here  to  the  catisutlon 
of  randoB  af facta  In  alxcd  linear  aodels.  The  treataant  is  self-contained  and 
gives  e study  of  when  the  BU)  end  LS  predictors  ere  identicsl.  In  pert  11  the 
■ethods  will  be  applied  to  the  closely  related  topic  of  eatiaatlng  Biasing  values 
in  the  analysis  variance. 
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{1  Introduction 

Frobleas  In  linear  Bodcls  or  In  the  analysis  of  variance  Rain  In  clarity  when 
they  are  perceived  as  slaiple  extensions  of  the  old  Pythagorean  theorem.  The 
coordinate-free  approach  to  univariate  linear  aodels  provides  an  imediate  means 
of  access  to  such  a geosMtrlcal  picture,  and  allows  a unified  approach  to  problems 
which  are  usually  thought  of  as  different  or  unrelated  such  as  the  estimation  of 
missing  or  mlxed-up  observations,  or  the  fitting  of  models  when  extra  data  are 
present  in  the  analysis  of  variance.  Indeed  these  problems  formed  the  basis 
of  Kruskal's  (1961)  early  advocacy  of  the  coordinate-free  viet^olnt,  and  the 
purpose  of  these  papers  is  to  show  how  problems  Involving  the  prediction  of 
unobserved  or  unobservable  random  variables  fit  Into  the  sasw  picture. 

The  main  results  of  this  first  paper  are  sho%m  to  include  ones  concerning 
linear  predlctioh  within  a classical  linear  model  framework,  due  to  Watson 
(1972),  as  well  as  others  due  to  Henderson  (1963),  Searle  (1974)  and  Harvllle  (1976) 
relating  to  the  "estimation”  of  random  effects  in  mixed  linear  models.  With  this 
background,  the  estiutlon  of  miaalng  or  mlxed-up  values  In  designed  experiments 
naturally  suggests  Itself,  and  In  the  second  paper  we  prove  that  the  solution  to 
this  problem  Is  identical  to  the  best  predictor  of  the  unobserved  values.  This 
result  extends  and  proves  an  observation  of  Falrf ield-Smith  (19S7). 

We  start  In  section  2 with  some  basic  definitions  and  properties  of  linear 
models  with  a possibly  singular  covariance  matrix.  Our  main  sources  here  are 
Kruskal  (1961,  1968),  Drygas  (1970),  Eaton  (1972)  and  Rao  (1974).  Section  3 gives 
a durivutlun  of  the  bust  linear  unbiased  predivlur  (BLUP)  fur  a vector  of  new 
values.  It  generalises  Watson's  (1972)  result  in  that  the  treatment  span 
(model  subspace)  la  allowed  to  intersect  with  the  null  space  of  the  covariance 
operator  and  its  auin  difference  with  Narvllle's  (1974)  approach  is  our  avoidance 
of  the  Zyaklnd-Martln  class  of  g-inverses.  In  section  4 we  obtain  conditions 
under  which  the  BLUP  is  Identical  to  the  LSP  , and  the  paper  concludes  by 
making  the  connection  with  the  classical  approach  to  prediction  and  with  the 
treatswnt  of  mixed  linear  models. 
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|2  A ceordln«t«-fre«  approach  to  unlvrlate  linear  ■odelw 

Let  2 be  a finite  dlaenalonal  vector  apace  (the  data  space)  of  dlaenslon  n , 
and  7 a linear  aanlfold  of  3 . Tha  apace  3 is  endowed  with  the  inner 
product 


<«.y>  - x*y 

where  x*  denotes  the  transpose  of  the  column  vector  x . Other  inner  products 
will  be  defined  on  3 later  on.  In  particular,  since  the  covariance  operator  will 
be  allowed  to  be  singular,  it  will  be  necessary  to  Introduce  a semi-inner  product, 
that  is,  a nonnegative  inner  product. 

A vector  of  n scalar  observations  is  a realization  of  a 3-valued  random 
vector  and  we  uke  a slight  departure  from  conventional  notations  in  that 
instead  of  using  capital  letters  for  the  observed  values,  lower  case  letters 
will  be  used  for  all  vectors  and  capital  letters  for  linear  operators (or  their 
matrix  representations).  Linear  spaces  will  be  denoted  by  script  letters  with 
the  corresponding  capital  letters  for  the  orthogonal  projections  onto  them  with 
respect  to  defined  above. 

The  expectation  Ey  of  y is  the  unique  vector  t of  3 such  that 
for  all  a t 3 

This  definition  does  not  depend  on  the  choice  of  a particular  inner  product  on 
3 . The  vector  t will  be  assumed  to  lie  in  the  subspace  7 of  3 . 

The  covariance  cov(y)  of  y is  the  unique  linear  operator  V on  .D 
such  that  for  all  a,b  c 3 

covI<a,y>,<b,y>l  - <«.Vb>  . 

Tha  operator  V is  nonnegativa  dafinita  and  synawtrical,  hence  ^a,Vb^  is 
a semi-inaar  product  on  3 t va  will  danota  it  by  ^a,l^y  > ^a,Vb^  . We  have 
varC^a.y^]  ■ ^a.Va^  ■ ^a.a^y  ■ Hally  . Claarly  the  definition  of  V does 
depend  on  Che  inner  product  used. 


r 
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For  any  subspaces  A , £ of  9 and  linear  operator  C , write 
A'*’  - (s  c 9 : ^z,a^>  0 for  all  a c A ) , CA  > {Ca  : a e A)  , 
A-ffia(a-fb:a<A,bc9)  and  A ■*■  £ Is  written  A • when 
A n £ a (0)  . Finally  we  denote  the  kernel  (or  null  space)  of  C by 
X(C)  • (s  t 9 ; Cs  • 0)  and  the  range  of  C by  ^ (C)  « (Cz  : z c 9 ) . We 
now  prove  the  following  results  due  to  Rao  (1974). 

Lemma  2 . 1 Let  y be  a random  vector  with  Cy  € U and  cov(y)  • V . Then 
y belongs  to  J+RiV)  with  probability  one. 

Proof:  We  first  note  that  17  + ^(V)  - (T^n  XfV))'^  , Then,  for  all 
z € 7"  n ;r(v)  we  have 

*<*.y>  ■ <z,T>-  0 ; vat<z,y>«<z,Vz>-  0 . 

Hence  for  all  z c n JffV)  , <z,y>  - 0 with  probability  one.  This  shows 
that  y t with  probability  one  or,  equivalently,  that  y t 17  * R(V) 

with  probability  one.D 

Lemma  2.2  17  + ^ (V)  - 7 • VjT^  . 

Proof:  We  first  show  that  7 n v7^  • (0)  . If  vfb  t 7 , where  T • I - T , tliuii 
llV^^^bll^  ■ <V^^^Tb,V^^^Tb>  ■ <b,TVTb>  - 0 whence  vfb  • 0 . Then  we  have 
7 •fi(fv)  - 7 • fRw  - 7 + R(v)  2 7*  M^(f)  - 7 •^(vt)  . But  since 

rank  (VT)  - rank  (TV)  we  must  have  equality. D 

When  7 • v7^  is  a proper  subspace  of  the  Euclidean  space  of  dimension  n , 
l.e.  when  V is  singular,  we  take  the  data  space  9 to  be  that  subspace  rather 
than  the  whole  space.  Now  every  direct  sun  decomposition  of  a vector  space  defines 
projections  onto  ore  of  the  components  along  the  other  and  in  our  case  we  write 
these  as  P z • x and  P x ■ x,  , where 

7iv7'  * V7"l7  * 
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s ■ >2  • t 3 . «2  ‘ 


i>  the  unique  decoapbeltion  of  s c S)  > 7 • . The  range  apace  and  fcarnel  of 

P and  P are  clearly  7 and  , and  vice  veraa,  reapectlvely, 

31V7"  10^17 

and  on  3 we  have 


+ P 


7iv7"  v7^l7 


- I 


(1) 


Theae  projections  are  directly  related  to  the  projections  ^ of  2 onto 
7 defined  relative  to  the  sesil'lnner  product  **y  (!■)  y)  <=  7 , 

and  <11)  for  all  y t 2)  and  t < 7 . lly  - Pj  yPll^  s II”  - t||^  ; equivalently, 
(11)'  TVPj  ^ ■ TV  . The  following  relation  Is  given  by  Rao  (1974) 

p - 1 - P*  on  3 . (2) 

7|v7^  7-^.v 


Wa  nou  derive  the  best  linear  unbiased  estlnstor  (BLUE)  of  a linear  functional 

^t.x^  , T t7  , where,  without  loss  of  generality,  we  can  restrict  ourselves  to 

coefficients  t c7  , since  for  all  t t 3,  <*,t>  ■ ^T*,t>  . Any  linear 

unbiased  estl«ator  of  of  Che  form  ^s,y^  where  t e 3 Is  such  that 

Tt  ■ t , that  la,  s • t - u for  oosm  u c 7*'  • Such  an  sstiaator  is  the  BLUE  of 

<t,T>  if  it  alnlaiaes  Bt^s.y^  - <t,r>)*  - lit  - uH^  , that  is,  using  the 

definition  above.  If  u ■ P t • Hence  the  BLUE  of  ^ la 

7^,V 

<t  - P t,y>  or,  using  (2), 

7 .V 


7IV7" 


By  letting  t range  over  an  orthononaal  basis  of  7 It  Is  easily  seen  that  the 
BLUE  of  Ey  V T Is  P y , and  this  estimator  Is  uniquely  determined  on  3 . 

7IV7' 


\ 


t 
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When  V le  not  elnguler.  In  wliich  case  s 1.  the  entire  Euclidean  space, 

the  BLUE  E y is  Identical  to  P ,7  , the  orthogonal  projection  of  the 

data  onto  7 v.r.t.  • 

The  following  result  gives  a necessary  and  sufficient  condition  for  the  BLUE 
of  Ey  to  be  Identical  to  the  least  squares  estimator  (LSE)  of  Ey  . The  proof 
is  particularly  simple. 

Lemroa  2.3  The  projection  onto  7 along  vT''  is  identical  to  the  orthogonal 

projection  T onto  7 if  and  only  if  V leaves  7 (or  equivalently  7^  ) 
invariant. 

Proof:  Clearly  if  V leaves  T*'  invariant,  then  P , • P , “ T . 
Conversely,  if  T • P , then  T ■ P ; this  implies  ^(T)»  fi(P  ) 

vri7 

or,  equivalently,  7^  ■ VT*’  .[] 

Problems  of  best  linear  unbiased  estimation  with  singular  covariance  matrices 
were  approached  differently  by  Philoche  (1971)  who  first  defined  an  inner  product 
restricted  on  fi(V)  , ^^Vx,Vy»  ■ ^Vx,y^  which  is  identical  to  <...>  , 

on  ^ when  V is  not  singular.  Philoche  then  defines  the  class  of  (Uuss-Markov 
operators,  that  is,  linear  operators  that  are  the  identity  on  7 and  whose 
restriction  to  fi(V)  coincides  with  the  orthogonal  projection  from  A(V)  onto 
^(V)  n 7 w.r.t.  V ‘ discuasion  is  restricted  to  the  space 

7‘f'  ^(V)  to  which  the  data  belongs  with  probability  one,  we  find  out  tliat  all 
the  operators  in  the  class  defined  by  Philoche  coincide  with  the  unique  operator 

^7IV7^  ‘ 


I 
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13 . tln—T  wdictton 

Atmma  that  «•  do  net  ebeervc  all  of  y , but  only  a “part*'  y^  of  it  and 
wa  want  to  uaa  tha  obaarwatleaa  y^  to  predict  y . Mora  praciaaly.  auppoca 
that  S>  ia  the  direct  auai  of  two  orthogonal  adbapaces 


2 . 9,  • a. 


reflecting  the  deconpoaltlon 


»1  * ^2 


of  the  full  data  vector  y Into  the  obaerved  part  • D^y  and  the  unobaerved 
part  yj  * D^y  • With  this  notation  our  aodel  becomes 


Eyi  t 7^ 

(3) 

cov  (y^)  - 1 • 1,  2 

(4) 

tov  (yj,y2>  * ’ 

(b) 

where  7^  ~ 0^7  are  the  aodel  subspaces  for  the  observed  and  unobserved  data 
respectively. 

The  problem  is  to  find  the  best  linear  unbiased  predictor  (BLUP)  of 
(or  equivalently  of  y ) based  on  the  realised  values  of  y^  , l.e.  to  find  A 
such  that  EllAyj  - y^ll^  is  minimum  subject  to  EAyj^  - Eyj  • Our  approach  to 
find  the  Bill?  of  y^  is  as  follows:  (1)  we  form  an  estimate  of  the  mean 


h • “l^ 

of 

>1  ‘ 

(li)  then  we  use  this  to 

get  an  estiimte 

of 

the 

mean 

tj  - 

of 

yj  i 

(ill)  finally  we  use  the 

deviation 

^i  - h 

of 

^1 

from 

a 

its  estimated 

•«an 

to  give  an  indication  of 

yj  - . 

The  BLUr 

of 

^2 

will 

then  be  of  the  form  e C(y^  - t^)  • 


I 


t 


We  will  need  the  following  two  leiniag.  Tlic  first  lemu  Is  a general  result 
on  synnetrlc  nonnegative  operators  for  which  a proof  can  be  found  In  Eaton  (1972, 
proposition  3.31).  The  aecond  leou  is  an  extension  of  proposition  4.3  In  Eaton 
(1972)  to  the  case  of  a general  nonnegative  covariance  operator. 

Lenaa  3.1  ^(D^VDj)  c ft(DjVDj^)  aid  sysnetrlcally  ^(DjVDj)  c fifD^VD^)  . 

Lenma  3.2  If  x Is  a randoa  vector  with  Ez  e A and  nonnegative  definite 
cov  (z)  • W , and  If  F Is  a linear  operator  from  A to  a linear  space  6 , 
then  Che  BLUE  of  FCz  la  F^  where  ^ Is  the  BLUE  of  Ez  . 


We  now  make  Che  following  Important  assumption: 

dlm^j^  • dim?  (6) 

A consequence  of  (6)  Is  that  for  every  ''j  * • f^ere  Is  a unique  x • < ? 

such  that  • x^  . This  correspoxtdextee  Is  clearly  linear  so  we  can  define  an 

operator  M 2 such  that  Md  • 0 If  d c 3^  e axid  Dj^Hx^  ■ x^  for  all 

Xj^  e . Now  by  lesmia  2.1  y^  Is  restricted  to  Che  linear  space 

with  probability  one  and  so  the  unique  BLUE  of  Ey,  ■ T,  Is  x,  • P y,  . 

^ ^ ^ S^KU^VD^)?^  ^ 

It  follows  from  lemma  3.2  that  Hx^  Is  Che  BLUE  of  Ey  ■ r using  y^  , and  that 

D2Mx^  Is  the  BLUE  x^  of  Ey2  * D2''  based  on  the  observed  data  y^  . 

With  these  preliminaries  we  can  now  give  our  theorem,  the  assumptions  and 
notations  being  as  In  (3),  (4),  (5)  and  (6). 


Theorem  3.1  The  BLUP  of  y,  based  on  y,  has  the  form 


92  ■ ^2  * 


where  J C '4  1*  * eywMctric  g-lnverse  of  D.VD.  « ^ f F , t,  Is  Che  BLUE 

i-1  ^ ^ ^ 1-0  ^ ^ 

of  Ey^  end  ^2  " ^y2  * 
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Proof:  For  any  predictor  of  the  fora  Ay^  the  unbiaacdneaa  condition  SAy^  * ^^2 
is  equivalent  to  At^  • ' ^2^1  * 

the  following  decoaposltlon,  where  - 1 - : 

Ayi  “ ATjyj  ♦ AT^y^  . 

or 

Ayi  - + KiPj  • (8) 

Further,  it  can  be  assuaed  without  loss  of  generality  that 

«(B*)  c /?(Ij(D^VDj)Tj)  ; (9) 

indeed  STj^y^^  ■ 0 and  cov  (T^y^)  “ » hence  by  lenma  2.1 

fjy^  £ «{Tj(DjVD^)T^)  . 

2 

Now  the  mean  square  error  E||Ay^  - y^ll  can  be  expanded  as 
EIKDjMTjDj  + BTjDj  - D2)yll^  - 

tr  LD2Mt^(Dj^VDj)TjM*U2J  + 2 tr  [D2MTj(DjVDj)TjB*]+  tr  [BTj(DjVDj)TjB»J 
+ tr  LDjVDjJ  - 2 tr  [DjKT^DjVDj]  - 2 tr  tBT^D^VDj]  . 

A standard  arguMnt  using  derivatives  or  coapletlng  the  square  shows  that  B 
corresponding  to  the  ainiaua  aean  square  error  satisfies 

Bfj(DjVDj)Tj  - »2''®1^1  ■ 02*‘^i<DiVDj)Tj 

and  hence,  using  (9), 

_ P 1 

B - (DjVDjj  - 

P -1  _ _ P 

where  { *4*B.  is  the  syaaetric  g-inverse  of  T. (D  VD. )T  ■ I e.E  in 

j»i 

spectral  fora,  with  e^  • 0 . 


I 


i 


I 


k 


Substituting  B into  (8)  gives,  sfter  regrouping  terns. 


Letting  have  spectral  decooposltlon 

k 

^ *l^i  • ^0  “ ° • 
i-0 


then  by  lensna  3.1  Z F.  “f  I identity  on  ^(D,VD_)  , 

i»l  ^ 'i»l  * V 1 i 

D^VDi  - I>2VD^(  . 

By  checking  that  J ^ satisfies  conditions  (i)  and  (ii 


conditions  (i)  and  (ii)' 


(section  2)  we  see  that  it  coincides  with  P , and  so  by  using  (2)  we  have 


("d-Vo  VY  I - 1 - P 

\ ^ V l\j.i  i V ^ U^\(VjVD^'P{ 

Hence,  by  substituting  (11)  and  (12)  into  (10),  we  have 

5,  - D m p y + (D.VD.VrfYFJl  y,  - 

and,  since  T,  is  the  identity  on  V,  and  P — ,y,  “ i,  is 

Ey^  , the  proof  is  conplete.L) 


the  BLUE  of 
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i4.  Wh«n  »re  th«  BLUP  «nd  the  LSP  Identical? 

It  la  aasy  to  verify  that  the  leaat  aquares  predictor  (LSP)  of  baaed 
on  y^  is 

^2  ■ 

that  is,  the  LSP  of  y^  is  identical  to  the  LS  estimator  of  Ey^  based  on  y^  . 
This  expression  is  much  simpler  to  evaluate  than  the  expression  for  the  BLUP. 
Moreover  the  LSP  does  not  involve  the  usually  unknown  covariance  operator  V . 

It  is  thus  of  interest  to  look  for  conditions  under  which  the  BLUP  can  be  replaced 
by  the  simpler  expression  for  the  LSP.  The  following  result  generalizes  a theoron 
proved  by  Vlatson  (^72);  the  framework  at  the  beginning  of  section  3 continues  to 
apply  and  7^  is  assumed  to  be  a fixed  subspace  of  2^  . 

Theorem  4.1  For  every  7 c 0 such  that  D^7  “ 7^  and  dim  7 “ dim7j^  , the 
LSP  of  y2  t ^2  coincides  with  the  BLUP  of  y^  if  and  only  if 

(1) 

(11)  ^(D2VD2)  c 7^  . 

Proof:  First  assume  that  (1)  and  (ii)  hold.  By  lemna  2.3  P Pi  * T.y 

7j|(DiVDi)7J  ^ 

and  hence  " 3^2  • •’F  using  the  expression  of  y2  given  in  (10)  it  is  easy 

to  see  that  the  second  term  is  zero  if  (il)  holds.  Conversely,  assume  that  for 
any  7 such  that  din  7 ■ dim72  we  have 

DiKtiyi  - ' h) ' 

Since  this  identity  must  hold  in  particular  for  7 • 7^  • that  is  for  • (0]  , 
we  must  have  (a)  (b)  (D2™i><^*i^fi><yi  - • 0 . 


r 
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Relation  (a)  auac  hold  In  particular  for  Buch  that  0x^2  “ dim 7 and  so 

since  M la  bijective.  This  implies  Chat  (D^VD^)7^  c 7j^  by  lemma  4.1. 
Now  using  the  expression  of  the  second  term  in  (10)  and  the  fact  chat  ^ ^ 

we  can  conclude  that  7^  c jr(D2VD^)  or  equivalently  that  ^(D^VD^)  c . This 
completes  the  proof .□ 


r 
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15.  Applications 

To  (how  how  our  roaults  relate  to  elaesleel  reaults  let  us  specify  a basis 
in  Sf  such  that  vectors  are  arrays  of  coordinates  in  that  basis,  and  linear 
operators  are  given  a aatrix  representation. 

Watson  (1972)  considered  a linear  nodel 

yi  ■ X^B  + 

where  y^  and  ace  n « 1 random  vectors,  is  a known  and  possibly 
singular  n » k design  matrix,  B is  a k « 1 vector  of  unknown  parameters, 
Ec^  • 0 and  var  • f , a nonnegative  symmetric  matrix.  The  problem  was  to 
find  the  BLUP  of 


y^  • x*6  + Ej 


where  the  column  vector  X2  belongs  to  the  column  space  C(X^)  of  X*  (the 

2 

row  space  of  X^  ) , var  Ej  " <>  >0  and  cov  (y^.Xj)  * Y . These  equations 

can  be  expressed  in  the  following  way: 


‘o' 

‘0  0 . . 0” 

‘o' 

^1 

■ 

*1 

B ♦ 

*1 

9 

0 

m 

0 ...  0 

B + 

0 

0 

0 ...  0 

0 

»2 

*2 

Letting  y*  ■ (yj  y^)  , X*  - (X^  s|)  end  c*  > (c|  t|)  we  can  write  ti.esc  as 

y - XB  ♦ e 

where  in  the  notation  of  section  3,  By  c C(X)  ■ 7 , nnd 
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The  condition  X2  e C(X^) 
It  it  easy  to  aee  that  If 

Implies  that 


is  equivalent  to  the  assumption  siade  that  disi?^  ~ dim?  • 

r has  spectral  decomposition  | g G.  , then  theorem  3.1 

1-0  ^ *• 


?2-x56  + Y‘(j^g;\)(y,  -Xid) 


where  0 is  a BUIE  of  0 . This  expression  for  the  BUIP  was  obtsined  by  Watson 
(1972)  under  the  assuaq>tion  that  ^(X^)  c C({)  , but  it  can  be  seen  from  theorem  3.1 
that  this  hypothasis  is  not  indispensable. 


Other  corollaries  of  thaoran  3.1  Include  (i)  the  fact  that  the  prediction  of 
a vector  of  new  values  is  equivalent  to  the  aeparatc  predlctiona  of  each  cosq>oncnt 
of  the  vector,  that  is,  the  BUIP  of  an  r-disMnsional  vector  is  the  vector  of  BLUP's 
of  each  coordinate;  (il)  a best  predictor  of  the  values  of  individual  observations 
when  only  the  stm  (or  Che  values  of  sosw  linear  combination)  of  these  observations 
has  been  observed;  (Hi)  an  alternative  derivation  of  results  concerning  estimation 
in  mixed  models.  He  now  consider  this  last  topic  in  greater  detail. 
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ProblMu  Involving  prndictlon  in  alxed  aodels  have  been  laplicit  In  the 
literature  on  anlaal  and  plant  breeding  for  aany  years,  see  especially 
Henderson  (1963) . and  for  aore  recent  results  Searle  (1974),  Henderson  (1975) 
and  Harvllle  (1976)  • These  aodels  are  usually  fomulated  as 

y ■ xe  + Zu  + e 

where  y la  a vector  of  n observations,  X a known  n » p aatrlx,  Z a known 
n « q aatrlx,  6 a vector  of  p unknown  parameters  ("fixed  effects")  and  u a 
vector  of  q randoa  variables  ("random  effects") . It  is  usually  assumed  that 
Su  - Ec  » cov  (u,t)  > 0 , cov  (c)  • r (non,  n.n.d.)  and  cov  (u)  - A (q  * q , n.n.d. 
The  problea  is  then  to  "estlaate"  a linear  coablnation 

r » k*6  + n*u  , k t ^(X*)  . 

Clearly  the  alxed  model  can  be  reformulated  as 

Ey  c C(X)  , COV  (y)  « 2AZ*  + I 

And  th«  estimation"  problem  is  simply  that  of  predicting  a random  variable  z 
whose  mean  and  variance  are  related  to  that  of  y : the  BLUE  of  Ez  can  be  deduced 
Inaedlately  from  the  BLUE  of  Ey  , var  z • n*Am  and  cov  (y,t)  " ZAn  . By  using  the 
above  correspondence  between  the  classical  and  coordinate-free  approaches,  theorem 
3.1  allows  us  to  express  the  BLUP  of  s as 

(BLUE  of  / effective  \ / residuals  \ 

based  on  1 + cov  (s,y)  « j inverse  j » | of  observed  ] 

BLUE  of  Ey y ^ of  cov  (y)  f ^ modul 

l.e.  £ - k*6  + (n*AZ*)^  J 

J 

where  B is  a BLUE  of  6 and  J B^C.  is  the  spectral  form  of 

i-0  ‘ ^ 


cov  (y)  . 


Ha  thus  see  that  the  sstiaatlon  of  randoai  effects  in  Mixed  models  is  not 
a sui  generis  problsa;  it  can  be  incorporated  within  the  class  of  probleas  of 
prediction  of  an  unobservable  randoa  variable  from  a realised  value  of  one  that 
can  be  observed. 

He  close  this  section  by  aaphasising  three  points  in  the  preceding  discussion. 
Firstly,  we  always  naad  a comdition  to  Insure  that  the  aean  of  the  unobserved 
variable  can  be  astiaated.  Secondly,  as  ahould  be  clear,  when  the  observed  and 
unobserved  quantities  are  uncorrelated,  the  latter  quantity  is  predicted  by  the 
BLUE  of  its  aean.  And  finally  we  remark  that  a singular  design  matrix  causes  no 
problem  in  a geometrical  approach,  nor  does  the  singularity  of  the  covariance 
matrix  provided  discussion  is  restricted  to  the  subspace  in  which  the  data  falls 
with  probability  one. 

Acknowledgenents:  I am  grateful  to  Dr  C.S.  Watson  for  bringing  the  topic 
of  prediction  to  my  attention  and  to  both  him  and  Dr  T.P.  Speed  for  useful 
discussions  on  the  subject. 
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